Breathy or Resonant - A Controlled and Curated Dataset for Phonation Mode Detection in Singing

نویسندگان

Polina Proutskova

Christophe Rhodes

Geraint A. Wiggins

Tim Crawford

چکیده

This paper presents a new reference dataset of sustained, sung vowels with attached labels indicating the phonation mode. The dataset is intended for training computational models for automated phonation mode detection. Four phonation modes are distinguished by Johan Sundberg [15]: breathy, neutral, flow (or resonant) and pressed. The presented dataset consists of ca. 700 recordings of nine vowels from several languages, sung at various pitches in various phonation modes. The recorded sounds were produced by one female singer under controlled conditions, following recommendations by voice acoustics researchers. While datasets on phonation modes in speech exist, such resources for singing are not available. Our dataset closes this gap and offers researchers in various disciplines a reference and a training set. It will be made available online under Creative Commons license. Also, the format of the dataset is extensible. Further content additions and future support for the dataset are planned. 1. MOTIVATION: NARROW, WIDE, BREATHY, RESONANT SINGING IN VARIOUS

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Breathy, Resonant, Pressed - Automatic Detection Of Phonation Mode From Audio Recordings of Singing

In this paper we present an experiment on automatic detection of phonation modes from recordings of sustained sung vowels. We created an open dataset specifically for this experiment, containing recordings of nine vowels from multiple languages, sung by a female singer on all pitches in her vocal range in phonation modes breathy, neutral, flow (resonant) and pressed. The dataset is available un...

متن کامل

Analysis and Classification of Phonation Modes In Singing

Phonation mode is an expressive aspect of the singing voice and can be described using the four categories neutral, breathy, pressed and flow. Previous attempts at automatically classifying the phonation mode on a dataset containing vowels sung by a female professional have been lacking in accuracy or have not sufficiently investigated the characteristic features of the different phonation mode...

متن کامل

Describing different styles of singing: a comparison of a female singer's voice source in "Classical", "Pop", "Jazz" and "Blues".

The voice is apparently used in quite different manners in different styles of singing. Some of these differences concern the voice source, which varies considerably with loudness, pitch, and mode of phonation. We attempt to describe voice source differences between Classical, Pop, Jazz and Blues styles of singing as produced in a triad melody pattern by a professional female singer in soft, mi...

متن کامل

Estimating perceived phonatory pressedness in singing from flow glottograms.

The normalized amplitude quotient (NAQ), defined as the ratio between the peak-to-peak amplitude of the flow pulse and the negative peak amplitude of the differentiated flow glottogram and normalized with respect to period time, has been shown to be related to glottal adduction. Glottal adduction, in turn, affects mode of phonation and hence perceived phonatory pressedness. The relationship bet...

متن کامل

Glottal source modeling for singing voice synthesis

Naturalness of sound quality is essential for singing-voice synthesis. Since 95% of singing is voiced sound (Cook, 1990), the focus of this paper is to improve the naturalness of the vowel tone quality via glottal excitation modeling. We propose to use the LF-model (Fant et al., 1985) for the glottal wave shape in conjunction with pitch-synchronous, amplitude-modulated Gaussian noise, which add...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Breathy or Resonant - A Controlled and Curated Dataset for Phonation Mode Detection in Singing

نویسندگان

چکیده

منابع مشابه

Breathy, Resonant, Pressed - Automatic Detection Of Phonation Mode From Audio Recordings of Singing

Analysis and Classification of Phonation Modes In Singing

Describing different styles of singing: a comparison of a female singer's voice source in "Classical", "Pop", "Jazz" and "Blues".

Estimating perceived phonatory pressedness in singing from flow glottograms.

Glottal source modeling for singing voice synthesis

عنوان ژورنال:

اشتراک گذاری